Overview

Dataset statistics

Number of variables26
Number of observations1605
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory326.1 KiB
Average record size in memory208.1 B

Variable types

Numeric25
Categorical1

Warnings

depth_midpoint_previous is highly correlated with depth_midpoint_currentHigh correlation
depth_midpoint_current is highly correlated with depth_midpoint_previousHigh correlation
Unnamed: 0 is uniformly distributed Uniform
Unnamed: 0 has unique values Unique
slope has 351 (21.9%) zeros Zeros
aspect has 24 (1.5%) zeros Zeros
cwd has 23 (1.4%) zeros Zeros

Reproduction

Analysis started2021-03-10 16:45:50.759744
Analysis finished2021-03-10 16:47:39.439336
Duration1 minute and 48.68 seconds
Software versionpandas-profiling v2.12.0
Download configurationconfig.yaml

Variables

Unnamed: 0
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct1605
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean802
Minimum0
Maximum1604
Zeros1
Zeros (%)0.1%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0
5-th percentile80.2
Q1401
median802
Q31203
95-th percentile1523.8
Maximum1604
Range1604
Interquartile range (IQR)802

Descriptive statistics

Standard deviation463.4679061
Coefficient of variation (CV)0.5778901573
Kurtosis-1.2
Mean802
Median Absolute Deviation (MAD)401
Skewness0
Sum1287210
Variance214802.5
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
16031
 
0.1%
10741
 
0.1%
10541
 
0.1%
10561
 
0.1%
10581
 
0.1%
10601
 
0.1%
10621
 
0.1%
10641
 
0.1%
10661
 
0.1%
10681
 
0.1%
Other values (1595)1595
99.4%
ValueCountFrequency (%)
01
0.1%
11
0.1%
21
0.1%
31
0.1%
41
0.1%
ValueCountFrequency (%)
16041
0.1%
16031
0.1%
16021
0.1%
16011
0.1%
16001
0.1%

temp_c
Real number (ℝ)

Distinct86
Distinct (%)5.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.66483074
Minimum-1
Maximum29
Zeros0
Zeros (%)0.0%
Negative4
Negative (%)0.2%
Memory size12.7 KiB

Quantile statistics

Minimum-1
5-th percentile7
Q19.5
median15
Q322
95-th percentile26
Maximum29
Range30
Interquartile range (IQR)12.5

Descriptive statistics

Standard deviation6.517625506
Coefficient of variation (CV)0.416067407
Kurtosis-1.172427871
Mean15.66483074
Median Absolute Deviation (MAD)5.9
Skewness0.1537982391
Sum25142.05333
Variance42.47944224
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10125
 
7.8%
13.9108
 
6.7%
22101
 
6.3%
2398
 
6.1%
1883
 
5.2%
975
 
4.7%
2460
 
3.7%
2657
 
3.6%
9.356
 
3.5%
1752
 
3.2%
Other values (76)790
49.2%
ValueCountFrequency (%)
-14
 
0.2%
1.25
 
0.3%
38
 
0.5%
523
1.4%
5.62
 
0.1%
ValueCountFrequency (%)
292
 
0.1%
28.58
0.5%
282
 
0.1%
27.95
0.3%
273
 
0.2%

precipitation_mm
Real number (ℝ≥0)

Distinct193
Distinct (%)12.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1061.195484
Minimum1.642
Maximum4065
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum1.642
5-th percentile353
Q1605
median948
Q31462
95-th percentile2090
Maximum4065
Range4063.358
Interquartile range (IQR)857

Descriptive statistics

Standard deviation602.1028798
Coefficient of variation (CV)0.5673816831
Kurtosis2.012908021
Mean1061.195484
Median Absolute Deviation (MAD)378
Skewness1.143206291
Sum1703218.752
Variance362527.8779
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2090108
 
6.7%
150093
 
5.8%
71956
 
3.5%
146248
 
3.0%
71445
 
2.8%
42442
 
2.6%
73542
 
2.6%
260030
 
1.9%
155327
 
1.7%
57827
 
1.7%
Other values (183)1087
67.7%
ValueCountFrequency (%)
1.6426
0.4%
7912
0.7%
13811
0.7%
1462
 
0.1%
1603
 
0.2%
ValueCountFrequency (%)
40656
 
0.4%
36702
 
0.1%
260030
1.9%
25006
 
0.4%
23703
 
0.2%

depth_midpoint_previous
Real number (ℝ≥0)

HIGH CORRELATION

Distinct58
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.25875389
Minimum0.15
Maximum175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0.15
5-th percentile5
Q110
median15
Q340
95-th percentile75
Maximum175
Range174.85
Interquartile range (IQR)30

Descriptive statistics

Standard deviation26.46977508
Coefficient of variation (CV)0.9366929335
Kurtosis5.897472685
Mean28.25875389
Median Absolute Deviation (MAD)8.5
Skewness2.060267673
Sum45355.3
Variance700.6489928
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15413
25.7%
10121
 
7.5%
7.5116
 
7.2%
5110
 
6.9%
5096
 
6.0%
2585
 
5.3%
22.579
 
4.9%
4559
 
3.7%
7556
 
3.5%
4050
 
3.1%
Other values (48)420
26.2%
ValueCountFrequency (%)
0.151
 
0.1%
0.41
 
0.1%
0.751
 
0.1%
116
1.0%
1.510
0.6%
ValueCountFrequency (%)
1756
0.4%
1605
0.3%
1363
0.2%
1353
0.2%
1256
0.4%

depth_midpoint_current
Real number (ℝ≥0)

HIGH CORRELATION

Distinct55
Distinct (%)3.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean28.35781931
Minimum0.15
Maximum175
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0.15
5-th percentile5
Q110
median15
Q340
95-th percentile75
Maximum175
Range174.85
Interquartile range (IQR)30

Descriptive statistics

Standard deviation26.40742133
Coefficient of variation (CV)0.9312218628
Kurtosis5.94758117
Mean28.35781931
Median Absolute Deviation (MAD)7.5
Skewness2.074106069
Sum45514.3
Variance697.3519011
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
15413
25.7%
10137
 
8.5%
7.5116
 
7.2%
5110
 
6.9%
5096
 
6.0%
2585
 
5.3%
22.579
 
4.9%
4559
 
3.7%
7558
 
3.6%
4050
 
3.1%
Other values (45)402
25.0%
ValueCountFrequency (%)
0.151
 
0.1%
0.41
 
0.1%
0.751
 
0.1%
1.510
0.6%
2.520
1.2%
ValueCountFrequency (%)
1756
0.4%
1605
0.3%
1363
0.2%
1353
0.2%
1256
0.4%

soc_g_kg_previous
Real number (ℝ≥0)

Distinct337
Distinct (%)21.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.62790322
Minimum1.1
Maximum78.7
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum1.1
5-th percentile3.6
Q16.956666667
median11.5
Q317.3
95-th percentile34.2
Maximum78.7
Range77.6
Interquartile range (IQR)10.34333333

Descriptive statistics

Standard deviation9.980985466
Coefficient of variation (CV)0.7323933334
Kurtosis7.715712168
Mean13.62790322
Median Absolute Deviation (MAD)4.78
Skewness2.213312396
Sum21872.78467
Variance99.62007088
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1497
 
6.0%
6.95666666779
 
4.9%
18.259
 
3.7%
4.243
 
2.7%
16.1933333339
 
2.4%
21.136
 
2.2%
11.3666666731
 
1.9%
2827
 
1.7%
9.11266666725
 
1.6%
10.1666666725
 
1.6%
Other values (327)1144
71.3%
ValueCountFrequency (%)
1.12
0.1%
1.21
 
0.1%
1.32
0.1%
1.54
0.2%
1.62
0.1%
ValueCountFrequency (%)
78.75
0.3%
65.51
 
0.1%
59.71
 
0.1%
51.92
 
0.1%
51.81
 
0.1%

soc_mg_ha_previous
Real number (ℝ≥0)

Distinct725
Distinct (%)45.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean68.4551838
Minimum0.6
Maximum1050
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0.6
5-th percentile3.43
Q114.1
median33.33
Q369.51
95-th percentile233.36
Maximum1050
Range1049.4
Interquartile range (IQR)55.41

Descriptive statistics

Standard deviation129.2433062
Coefficient of variation (CV)1.887998819
Kurtosis29.40743579
Mean68.4551838
Median Absolute Deviation (MAD)22.13333333
Skewness5.013558208
Sum109870.57
Variance16703.83219
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.0233333346
 
2.9%
22.9139
 
2.4%
333.430
 
1.9%
436.7818
 
1.1%
95517
 
1.1%
2716
 
1.0%
38.0933333315
 
0.9%
37.3966666714
 
0.9%
2.7314
 
0.9%
4.5814
 
0.9%
Other values (715)1382
86.1%
ValueCountFrequency (%)
0.61
 
0.1%
12
 
0.1%
1.061
 
0.1%
1.11
 
0.1%
1.13666666712
0.7%
ValueCountFrequency (%)
10501
 
0.1%
9902
 
0.1%
95517
1.1%
8852
 
0.1%
6061
 
0.1%

latitud
Real number (ℝ)

Distinct207
Distinct (%)12.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean22.82641745
Minimum-37.62
Maximum64.33
Zeros0
Zeros (%)0.0%
Negative481
Negative (%)30.0%
Memory size12.7 KiB

Quantile statistics

Minimum-37.62
5-th percentile-32.76
Q1-8.03
median37.89
Q348.75
95-th percentile53.7
Maximum64.33
Range101.95
Interquartile range (IQR)56.78

Descriptive statistics

Standard deviation30.01959462
Coefficient of variation (CV)1.315125104
Kurtosis-1.176529709
Mean22.82641745
Median Absolute Deviation (MAD)13.54
Skewness-0.6649155977
Sum36636.4
Variance901.1760611
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41.91108
 
6.7%
-33.8366
 
4.1%
49.856
 
3.5%
-18.4848
 
3.0%
48.7545
 
2.8%
29.7242
 
2.6%
-3.1230
 
1.9%
53.8227
 
1.7%
-19.7527
 
1.7%
44.9724
 
1.5%
Other values (197)1132
70.5%
ValueCountFrequency (%)
-37.6211
 
0.7%
-35.722
 
0.1%
-35.291
 
0.1%
-33.8366
4.1%
-32.765
 
0.3%
ValueCountFrequency (%)
64.331
 
0.1%
63.21
 
0.1%
58.731
 
0.1%
57.621
 
0.1%
56.8312
0.7%

longitud
Real number (ℝ)

Distinct217
Distinct (%)13.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-7.088870197
Minimum-122.49
Maximum175.92
Zeros0
Zeros (%)0.0%
Negative845
Negative (%)52.6%
Memory size12.7 KiB

Quantile statistics

Minimum-122.49
5-th percentile-97.934
Q1-49.14
median-1.2
Q314.62
95-th percentile104
Maximum175.92
Range298.41
Interquartile range (IQR)63.76

Descriptive statistics

Standard deviation56.77242343
Coefficient of variation (CV)-8.008670191
Kurtosis0.1307959161
Mean-7.088870197
Median Absolute Deviation (MAD)43.18
Skewness0.4648744747
Sum-11377.63667
Variance3223.108063
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41.98108
 
6.7%
6.7274
 
4.6%
19.966
 
4.1%
-49.1448
 
3.0%
8.9245
 
2.8%
76.9742
 
2.6%
-60.0230
 
1.9%
12.0727
 
1.7%
-48.6427
 
1.7%
9.6824
 
1.5%
Other values (207)1114
69.4%
ValueCountFrequency (%)
-122.492
 
0.1%
-109.5712
0.7%
-108.755
0.3%
-106.354
 
0.2%
-104.85
0.3%
ValueCountFrequency (%)
175.9211
0.7%
144.451
 
0.1%
139.852
 
0.1%
1272
 
0.1%
1202
 
0.1%

elevation
Real number (ℝ)

Distinct191
Distinct (%)11.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean366.1750779
Minimum-3
Maximum2514
Zeros0
Zeros (%)0.0%
Negative4
Negative (%)0.2%
Memory size12.7 KiB

Quantile statistics

Minimum-3
5-th percentile18
Q1125
median246
Q3535
95-th percentile1134.4
Maximum2514
Range2517
Interquartile range (IQR)410

Descriptive statistics

Standard deviation375.7752878
Coefficient of variation (CV)1.02621754
Kurtosis5.252558044
Mean366.1750779
Median Absolute Deviation (MAD)199
Skewness2.03471172
Sum587711
Variance141207.0669
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
137112
 
7.0%
16266
 
4.1%
15056
 
3.5%
47651
 
3.2%
53548
 
3.0%
25342
 
2.6%
1039
 
2.4%
4732
 
2.0%
63627
 
1.7%
10324
 
1.5%
Other values (181)1108
69.0%
ValueCountFrequency (%)
-34
 
0.2%
39
 
0.6%
96
 
0.4%
1039
2.4%
112
 
0.1%
ValueCountFrequency (%)
25141
 
0.1%
195112
0.7%
19014
 
0.2%
184518
1.1%
17392
 
0.1%

slope
Real number (ℝ≥0)

ZEROS

Distinct10
Distinct (%)0.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean70.31294704
Minimum0
Maximum90
Zeros351
Zeros (%)21.9%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0
5-th percentile0
Q189.97
median90
Q390
95-th percentile90
Maximum90
Range90
Interquartile range (IQR)0.03

Descriptive statistics

Standard deviation37.21135331
Coefficient of variation (CV)0.5292247712
Kurtosis-0.1441595302
Mean70.31294704
Median Absolute Deviation (MAD)0
Skewness-1.36235849
Sum112852.28
Variance1384.684815
MonotocityNot monotonic
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%)
90858
53.5%
0351
21.9%
89.99263
 
16.4%
89.9768
 
4.2%
89.9844
 
2.7%
89.9210
 
0.6%
89.964
 
0.2%
89.873
 
0.2%
89.72
 
0.1%
89.892
 
0.1%
ValueCountFrequency (%)
0351
21.9%
89.72
 
0.1%
89.873
 
0.2%
89.892
 
0.1%
89.9210
 
0.6%
ValueCountFrequency (%)
90858
53.5%
89.99263
 
16.4%
89.9844
 
2.7%
89.9768
 
4.2%
89.964
 
0.2%

slopeclass
Real number (ℝ≥0)

Distinct12
Distinct (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.440706127
Minimum1
Maximum8
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile6
Maximum8
Range7
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.544851405
Coefficient of variation (CV)0.4489925463
Kurtosis-0.5265035054
Mean3.440706127
Median Absolute Deviation (MAD)1
Skewness0.3523611714
Sum5522.333333
Variance2.386565863
MonotocityNot monotonic
Histogram with fixed size bins (bins=12)
ValueCountFrequency (%)
3390
24.3%
2374
23.3%
5347
21.6%
4211
13.1%
1142
 
8.8%
6103
 
6.4%
816
 
1.0%
715
 
0.9%
5.3333333333
 
0.2%
4.6666666672
 
0.1%
Other values (2)2
 
0.1%
ValueCountFrequency (%)
1142
 
8.8%
1.3333333331
 
0.1%
2374
23.3%
3390
24.3%
4211
13.1%
ValueCountFrequency (%)
816
 
1.0%
715
 
0.9%
6103
6.4%
5.6666666671
 
0.1%
5.3333333333
 
0.2%

aspect
Real number (ℝ≥0)

ZEROS

Distinct196
Distinct (%)12.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean173.005215
Minimum0
Maximum360
Zeros24
Zeros (%)1.5%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0
5-th percentile6.77
Q186.63
median159.78
Q3273.37
95-th percentile341.57
Maximum360
Range360
Interquartile range (IQR)186.74

Descriptive statistics

Standard deviation107.693652
Coefficient of variation (CV)0.6224878943
Kurtosis-1.194809579
Mean173.005215
Median Absolute Deviation (MAD)98.66
Skewness0.04388824022
Sum277673.37
Variance11597.92267
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
302.09108
 
6.7%
143.3766
 
4.1%
6.7756
 
3.5%
273.3748
 
3.0%
22.8345
 
2.8%
329.0444
 
2.7%
27031
 
1.9%
261.1630
 
1.9%
13528
 
1.7%
155.8527
 
1.7%
Other values (186)1122
69.9%
ValueCountFrequency (%)
024
1.5%
3.375
 
0.3%
3.742
 
0.1%
4.0912
0.7%
4.7612
0.7%
ValueCountFrequency (%)
3609
0.6%
354.472
 
0.1%
354.093
 
0.2%
352.971
 
0.1%
352.414
0.2%

hillshade
Categorical

Distinct3
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.7 KiB
0.0
961 
158.33
409 
42.42
235 

Length

Max length6
Median length3
Mean length4.057320872
Min length3

Characters and Unicode

Total characters6512
Distinct characters8
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row158.33
2nd row158.33
3rd row158.33
4th row158.33
5th row158.33
ValueCountFrequency (%)
0.0961
59.9%
158.33409
25.5%
42.42235
 
14.6%
Histogram of lengths of the category
ValueCountFrequency (%)
0.0961
59.9%
158.33409
25.5%
42.42235
 
14.6%

Most occurring characters

ValueCountFrequency (%)
01922
29.5%
.1605
24.6%
3818
12.6%
4470
 
7.2%
2470
 
7.2%
1409
 
6.3%
5409
 
6.3%
8409
 
6.3%

Most occurring categories

ValueCountFrequency (%)
Decimal Number4907
75.4%
Other Punctuation1605
 
24.6%

Most frequent character per category

ValueCountFrequency (%)
01922
39.2%
3818
16.7%
4470
 
9.6%
2470
 
9.6%
1409
 
8.3%
5409
 
8.3%
8409
 
8.3%
ValueCountFrequency (%)
.1605
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common6512
100.0%

Most frequent character per script

ValueCountFrequency (%)
01922
29.5%
.1605
24.6%
3818
12.6%
4470
 
7.2%
2470
 
7.2%
1409
 
6.3%
5409
 
6.3%
8409
 
6.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII6512
100.0%

Most frequent character per block

ValueCountFrequency (%)
01922
29.5%
.1605
24.6%
3818
12.6%
4470
 
7.2%
2470
 
7.2%
1409
 
6.3%
5409
 
6.3%
8409
 
6.3%

rougness
Real number (ℝ≥0)

Distinct200
Distinct (%)12.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.14402077
Minimum0
Maximum315.63
Zeros8
Zeros (%)0.5%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0
5-th percentile4.36
Q110.05
median20.76
Q346.1
95-th percentile114.18
Maximum315.63
Range315.63
Interquartile range (IQR)36.05

Descriptive statistics

Standard deviation42.34846696
Coefficient of variation (CV)1.204997779
Kurtosis9.58259801
Mean35.14402077
Median Absolute Deviation (MAD)13.28
Skewness2.706988334
Sum56406.15333
Variance1793.392654
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
46.1108
 
6.7%
22.4366
 
4.1%
5.4861
 
3.8%
104.3556
 
3.5%
11.3650
 
3.1%
1648
 
3.0%
29.2245
 
2.8%
24.0430
 
1.9%
19.0327
 
1.7%
10.7727
 
1.7%
Other values (190)1087
67.7%
ValueCountFrequency (%)
08
0.5%
1.418
0.5%
29
0.6%
2.452
 
0.1%
2.834
0.2%
ValueCountFrequency (%)
315.632
 
0.1%
292.554
 
0.2%
242.823
 
0.2%
222.1916
1.0%
189.992
 
0.1%

cwd
Real number (ℝ)

ZEROS

Distinct209
Distinct (%)13.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-384.1270218
Minimum-1451.29
Maximum0
Zeros23
Zeros (%)1.4%
Negative1582
Negative (%)98.6%
Memory size12.7 KiB

Quantile statistics

Minimum-1451.29
5-th percentile-1108.83
Q1-507.33
median-256
Q3-138.71
95-th percentile-30.46
Maximum0
Range1451.29
Interquartile range (IQR)368.62

Descriptive statistics

Standard deviation346.9062657
Coefficient of variation (CV)-0.9031029997
Kurtosis0.1909949419
Mean-384.1270218
Median Absolute Deviation (MAD)180.58
Skewness-1.102328575
Sum-616523.87
Variance120343.9572
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-30.46108
 
6.7%
-198.5874
 
4.6%
-1108.8366
 
4.1%
-69.4254
 
3.4%
-507.3348
 
3.0%
-927.0442
 
2.6%
-18632
 
2.0%
-189.7130
 
1.9%
-405.2927
 
1.7%
-255.4224
 
1.5%
Other values (199)1100
68.5%
ValueCountFrequency (%)
-1451.2912
0.7%
-1389.582
 
0.1%
-1275.5412
0.7%
-1248.758
0.5%
-1174.544
 
0.2%
ValueCountFrequency (%)
023
1.4%
-0.52
 
0.1%
-10.6711
0.7%
-11.51
 
0.1%
-16.048
 
0.5%

pt10
Real number (ℝ≥0)

Distinct218
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.36081828
Minimum0.01
Maximum131.07
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0.01
5-th percentile15.09
Q130.79
median40.97
Q350.93
95-th percentile85.01
Maximum131.07
Range131.06
Interquartile range (IQR)20.14

Descriptive statistics

Standard deviation18.93595019
Coefficient of variation (CV)0.4470156847
Kurtosis0.884345287
Mean42.36081828
Median Absolute Deviation (MAD)10.18
Skewness0.7080079546
Sum67989.11333
Variance358.5702094
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
85.01108
 
6.7%
15.0966
 
4.1%
38.0761
 
3.8%
43.9648
 
3.0%
41.8645
 
2.8%
21.6542
 
2.6%
62.7130
 
1.9%
47.7327
 
1.7%
30.7927
 
1.7%
37.8424
 
1.5%
Other values (208)1127
70.2%
ValueCountFrequency (%)
0.018
0.5%
5.2812
0.7%
7.082
 
0.1%
8.432
 
0.1%
12.9512
0.7%
ValueCountFrequency (%)
131.072
 
0.1%
102.711
 
0.1%
101.932
 
0.1%
101.043
0.2%
92.46
0.4%

avg_percipitation
Real number (ℝ≥0)

Distinct129
Distinct (%)8.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean110.2801661
Minimum0
Maximum370
Zeros12
Zeros (%)0.7%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum0
5-th percentile12
Q139
median64
Q3191
95-th percentile288
Maximum370
Range370
Interquartile range (IQR)152

Descriptive statistics

Standard deviation97.32284784
Coefficient of variation (CV)0.8825054517
Kurtosis-0.5073600095
Mean110.2801661
Median Absolute Deviation (MAD)44
Skewness0.928165467
Sum176999.6667
Variance9471.736712
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
191108
 
6.7%
1887
 
5.4%
6269
 
4.3%
3460
 
3.7%
5452
 
3.2%
27248
 
3.0%
4945
 
2.8%
28838
 
2.4%
4236
 
2.2%
6533
 
2.1%
Other values (119)1029
64.1%
ValueCountFrequency (%)
012
0.7%
112
0.7%
34
 
0.2%
49
0.6%
57
0.4%
ValueCountFrequency (%)
37015
0.9%
3294
 
0.2%
32418
1.1%
3222
 
0.1%
30515
0.9%

bd_avg
Real number (ℝ≥0)

Distinct205
Distinct (%)12.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1303.795202
Minimum688
Maximum1512
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum688
5-th percentile1061.5
Q11232.67
median1322.33
Q31379.5
95-th percentile1472.5
Maximum1512
Range824
Interquartile range (IQR)146.83

Descriptive statistics

Standard deviation122.8896505
Coefficient of variation (CV)0.09425533267
Kurtosis1.688913801
Mean1303.795202
Median Absolute Deviation (MAD)61.67
Skewness-1.007073387
Sum2092591.3
Variance15101.86621
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1202.33108
 
6.7%
1491.566
 
4.1%
1471.6758
 
3.6%
129356
 
3.5%
131948
 
3.0%
128645
 
2.8%
1379.535
 
2.2%
138334
 
2.1%
1362.530
 
1.9%
130827
 
1.7%
Other values (195)1098
68.4%
ValueCountFrequency (%)
6884
0.2%
845.51
 
0.1%
909.671
 
0.1%
929.55
0.3%
944.53
0.2%
ValueCountFrequency (%)
15124
 
0.2%
1508.58
 
0.5%
1491.566
4.1%
14912
 
0.1%
1472.512
 
0.7%

ph_avg
Real number (ℝ≥0)

Distinct116
Distinct (%)7.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean61.14003115
Minimum45
Maximum84
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum45
5-th percentile50.5
Q154.5
median59.5
Q366
95-th percentile77.45
Maximum84
Range39
Interquartile range (IQR)11.5

Descriptive statistics

Standard deviation8.384852445
Coefficient of variation (CV)0.1371417758
Kurtosis-0.5611486727
Mean61.14003115
Median Absolute Deviation (MAD)5.25
Skewness0.5176320428
Sum98129.75
Variance70.30575052
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
54.5115
 
7.2%
64.7575
 
4.7%
57.2568
 
4.2%
62.7568
 
4.2%
7067
 
4.2%
5266
 
4.1%
50.551
 
3.2%
5350
 
3.1%
64.2543
 
2.7%
5442
 
2.6%
Other values (106)960
59.8%
ValueCountFrequency (%)
4510
0.6%
45.251
 
0.1%
45.7518
1.1%
461
 
0.1%
475
 
0.3%
ValueCountFrequency (%)
842
 
0.1%
80.255
 
0.3%
79.758
0.5%
79.512
0.7%
79.2514
0.9%

cat_avg
Real number (ℝ≥0)

Distinct91
Distinct (%)5.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.62056075
Minimum5.25
Maximum50.5
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum5.25
5-th percentile7
Q111.75
median18
Q321.5
95-th percentile28.05
Maximum50.5
Range45.25
Interquartile range (IQR)9.75

Descriptive statistics

Standard deviation6.925355805
Coefficient of variation (CV)0.3930269816
Kurtosis1.791071236
Mean17.62056075
Median Absolute Deviation (MAD)4.75
Skewness0.7545410865
Sum28281
Variance47.96055302
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19.5128
 
8.0%
13.7588
 
5.5%
17.580
 
5.0%
11.7569
 
4.3%
1164
 
4.0%
23.562
 
3.9%
21.559
 
3.7%
1859
 
3.7%
7.7545
 
2.8%
20.7540
 
2.5%
Other values (81)911
56.8%
ValueCountFrequency (%)
5.2512
 
0.7%
6.256
 
0.4%
6.534
2.1%
6.7518
1.1%
715
0.9%
ValueCountFrequency (%)
50.54
 
0.2%
476
 
0.4%
41.751
 
0.1%
36.7518
1.1%
35.55
 
0.3%

clay_avg
Real number (ℝ≥0)

Distinct102
Distinct (%)6.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.93650052
Minimum2.5
Maximum44
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum2.5
5-th percentile12.75
Q120.5
median26.25
Q330.25
95-th percentile38
Maximum44
Range41.5
Interquartile range (IQR)9.75

Descriptive statistics

Standard deviation7.672610514
Coefficient of variation (CV)0.295822889
Kurtosis-0.01006382244
Mean25.93650052
Median Absolute Deviation (MAD)4.5
Skewness-0.003036648845
Sum41628.08333
Variance58.86895211
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
30.25110
 
6.9%
22.2579
 
4.9%
29.7563
 
3.9%
16.559
 
3.7%
35.558
 
3.6%
20.556
 
3.5%
2852
 
3.2%
2650
 
3.1%
4448
 
3.0%
27.7548
 
3.0%
Other values (92)982
61.2%
ValueCountFrequency (%)
2.52
 
0.1%
46
0.4%
6.251
 
0.1%
6.751
 
0.1%
7.513
0.8%
ValueCountFrequency (%)
4448
3.0%
42.251
 
0.1%
41.252
 
0.1%
39.753
 
0.2%
39.254
 
0.2%

slit_avg
Real number (ℝ≥0)

Distinct142
Distinct (%)8.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.03161994
Minimum6.75
Maximum64.25
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum6.75
5-th percentile13
Q119.5
median32.25
Q341.5
95-th percentile53.25
Maximum64.25
Range57.5
Interquartile range (IQR)22

Descriptive statistics

Standard deviation13.1474565
Coefficient of variation (CV)0.4236793478
Kurtosis-0.9641516511
Mean31.03161994
Median Absolute Deviation (MAD)10.5
Skewness0.1958998105
Sum49805.75
Variance172.8556124
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
19.5124
 
7.7%
35.75117
 
7.3%
13.7566
 
4.1%
41.562
 
3.9%
47.547
 
2.9%
22.546
 
2.9%
15.544
 
2.7%
1342
 
2.6%
17.2527
 
1.7%
34.7524
 
1.5%
Other values (132)1006
62.7%
ValueCountFrequency (%)
6.754
0.2%
89
0.6%
9.52
 
0.1%
9.756
0.4%
10.583333331
 
0.1%
ValueCountFrequency (%)
64.252
 
0.1%
631
 
0.1%
622
 
0.1%
59.753
0.2%
59.255
0.3%

sand_avg
Real number (ℝ≥0)

Distinct136
Distinct (%)8.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean43.03187954
Minimum9.5
Maximum82.75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum9.5
5-th percentile21.25
Q134
median42
Q354.25
95-th percentile64.75
Maximum82.75
Range73.25
Interquartile range (IQR)20.25

Descriptive statistics

Standard deviation13.84781531
Coefficient of variation (CV)0.3218036361
Kurtosis-0.3736465888
Mean43.03187954
Median Absolute Deviation (MAD)9.25
Skewness0.2337130123
Sum69066.16667
Variance191.7619887
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
34119
 
7.4%
4575
 
4.7%
6466
 
4.1%
4261
 
3.8%
24.560
 
3.7%
36.554
 
3.4%
54.7553
 
3.3%
27.539
 
2.4%
5937
 
2.3%
40.535
 
2.2%
Other values (126)1006
62.7%
ValueCountFrequency (%)
9.51
 
0.1%
14.752
 
0.1%
15.2514
0.9%
15.7523
1.4%
164
 
0.2%
ValueCountFrequency (%)
82.756
0.4%
80.52
 
0.1%
79.254
 
0.2%
77.54
 
0.2%
75.2512
0.7%

gridsoc_g_kg_avg
Real number (ℝ≥0)

Distinct157
Distinct (%)9.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean33.06370717
Minimum3.75
Maximum164.75
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size12.7 KiB

Quantile statistics

Minimum3.75
5-th percentile7.75
Q115
median29.5
Q348
95-th percentile76.5
Maximum164.75
Range161
Interquartile range (IQR)33

Descriptive statistics

Standard deviation23.68881271
Coefficient of variation (CV)0.7164596696
Kurtosis5.4698822
Mean33.06370717
Median Absolute Deviation (MAD)14.83333333
Skewness1.792597784
Sum53067.25
Variance561.1598477
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
48.75108
 
6.7%
7.7578
 
4.9%
16.562
 
3.9%
16.2560
 
3.7%
43.556
 
3.5%
14.548
 
3.0%
37.7545
 
2.8%
30.2537
 
2.3%
3732
 
2.0%
9.531
 
1.9%
Other values (147)1048
65.3%
ValueCountFrequency (%)
3.752
 
0.1%
4.524
 
1.5%
7.7578
4.9%
82
 
0.1%
8.2512
 
0.7%
ValueCountFrequency (%)
164.755
0.3%
162.51
 
0.1%
1582
 
0.1%
1484
0.2%
133.52
 
0.1%

soc_change_mg
Real number (ℝ)

Distinct1021
Distinct (%)63.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-1.266650052
Minimum-490
Maximum324
Zeros12
Zeros (%)0.7%
Negative636
Negative (%)39.6%
Memory size12.7 KiB

Quantile statistics

Minimum-490
5-th percentile-31.782
Q1-3.6
median1.7
Q38.69
95-th percentile53.19333333
Maximum324
Range814
Interquartile range (IQR)12.29

Descriptive statistics

Standard deviation54.52941232
Coefficient of variation (CV)-43.05010073
Kurtosis27.8776923
Mean-1.266650052
Median Absolute Deviation (MAD)6.1
Skewness-4.299485809
Sum-2032.973333
Variance2973.456808
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
48.3842
 
2.6%
23.4033333336
 
2.2%
-317.229
 
1.8%
53.2666666718
 
1.1%
-68.3333333317
 
1.1%
4.2415
 
0.9%
64.8466666714
 
0.9%
-4.36666666714
 
0.9%
0.286666666712
 
0.7%
012
 
0.7%
Other values (1011)1396
87.0%
ValueCountFrequency (%)
-4901
 
0.1%
-4301
 
0.1%
-3251
 
0.1%
-317.229
1.8%
-316.96666671
 
0.1%
ValueCountFrequency (%)
3241
0.1%
294.11
0.1%
1651
0.1%
140.851
0.1%
132.691
0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

Unnamed: 0temp_cprecipitation_mmdepth_midpoint_previousdepth_midpoint_currentsoc_g_kg_previoussoc_mg_ha_previouslatitudlongitudelevationslopeslopeclassaspecthillshaderougnesscwdpt10avg_percipitationbd_avgph_avgcat_avgclay_avgslit_avgsand_avggridsoc_g_kg_avgsoc_change_mg
0021.01500.05.05.048.39333317.39-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.53.31
1121.01500.015.015.043.58000015.57-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.5-3.03
2221.01500.040.040.043.58000055.38-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.5-24.46
3321.01500.080.080.036.72000036.10-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.50.13
4421.01500.05.05.048.39333316.28-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.50.15
5521.01500.015.015.043.58000018.26-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.5-3.75
6621.01500.040.040.043.58000063.95-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.5-21.25
7721.01500.080.080.036.72000049.20-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.5-9.78
8821.01500.05.05.048.39333313.59-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.51.01
9921.01500.015.015.043.58000015.33-21.35-47.07622.090.03.0329.53158.3333.97-320.3345.65261.01347.552.011.035.519.545.016.50.63

Last rows

Unnamed: 0temp_cprecipitation_mmdepth_midpoint_previousdepth_midpoint_currentsoc_g_kg_previoussoc_mg_ha_previouslatitudlongitudelevationslopeslopeclassaspecthillshaderougnesscwdpt10avg_percipitationbd_avgph_avgcat_avgclay_avgslit_avgsand_avggridsoc_g_kg_avgsoc_change_mg
1595159524.0735.0125.0125.04.20021.20000029.7276.97253.090.02.0329.04158.335.48-927.0421.6534.01471.6752.011.0035.5019.5045.0016.25-6.800000
1596159624.0735.0175.0175.04.20014.10000029.7276.97253.090.02.0329.04158.335.48-927.0421.6534.01471.6752.011.0035.5019.5045.0016.250.800000
1597159722.31166.07.57.519.26091.47666721.54101.16587.090.06.0258.9342.42155.94-316.580.0111.01299.6753.518.0031.5030.7537.7520.758.690000
1598159822.31166.022.522.518.20091.47666721.54101.16587.090.06.0258.9342.42155.94-316.580.0111.01299.6753.518.0031.5030.7537.7520.758.690000
1599159922.31166.037.537.516.06091.47666721.54101.16587.090.06.0258.9342.42155.94-316.580.0111.01299.6753.518.0031.5030.7537.7520.758.690000
1600160022.31166.052.552.512.700113.72666721.54101.16587.090.06.0258.9342.42155.94-316.580.0111.01299.6753.518.0031.5030.7537.7520.7525.366667
1601160122.31166.07.57.511.85029.56666721.56101.14795.090.06.066.630.00292.55-316.580.0112.01250.0053.520.7532.7529.7537.5021.75-8.133333
1602160222.31166.022.522.57.98829.56666721.56101.14795.090.06.066.630.00292.55-316.580.0112.01250.0053.520.7532.7529.7537.5021.75-8.133333
1603160322.31166.037.537.55.42629.56666721.56101.14795.090.06.066.630.00292.55-316.580.0112.01250.0053.520.7532.7529.7537.5021.75-8.133333
1604160422.31166.052.552.53.26429.56666721.56101.14795.090.06.066.630.00292.55-316.580.0112.01250.0053.520.7532.7529.7537.5021.75-8.133333